High GC Content Causes De Novo Created Proteins to be Intrinsically Disordered

نویسندگان

  • Walter Basile
  • Oxana Sachenkova
  • Sara Light
  • Arne Elofsson
چکیده

De novo creation of protein coding genes involves formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. De novo created proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not cause aggregation. Therefore, although the creation of the short ORFs could be truly random, but the fixation should be of subject to some selective pressure. The selective forces acting on de novo created proteins have been elusive and contradictory results have been reported. In Drosophila they are more disordered, i.e. are enriched in polar residues, than ancient proteins, while the opposite trend is present in yeast. To the best of our knowledge no valid explanation for this difference has been proposed. To solve this riddle we studied structural properties and age of all proteins in 187 eukaryotic species. We find that, on average, there are small differences between proteins of different ages, with the exception that younger proteins are shorter. However, when we take the GC content into account we find that this can explain the opposite trends observed in yeast (low GC) and drosophila (high GC). GC content is correlated with codons coding for disorder-promoting amino acids, and inversely correlated with transmembrane, helix and sheet promoting residues. We find that for the youngest proteins, i.e. the ones that are most likely to be de novo created, there exists a strong correlation with GC and structural properties. In contrast, this strong relationship is not seen for ancient proteins. This leads us to propose that structural features are not a strong determining factor for fixation of de novo created genes. Instead these proteins resemble random proteins given a particular GC level. The dependency on GC content is then gradually weakened during evolution.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High GC content causes orphan proteins to be intrinsically disordered

De novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. These orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not aggregate. Therefore, although the creation of short ORFs could be truly random, the fixation shoul...

متن کامل

Comparison of Structure Determination Methods for Intrinsically Disordered Amyloid-β Peptides

Intrinsically disordered proteins (IDPs) represent a new frontier in structural biology since the primary characteristic of IDPs is that structures need to be characterized as diverse ensembles of conformational substates. We compare two general but very different ways of combining NMR spectroscopy with theoretical methods to derive structural ensembles for the disease IDPs amyloid-β 1-40 and a...

متن کامل

Intrinsically Disordered Proteins in a Physics-Based World

Intrinsically disordered proteins (IDPs) are a newly recognized class of functional proteins that rely on a lack of stable structure for function. They are highly prevalent in biology, play fundamental roles, and are extensively involved in human diseases. For signaling and regulation, IDPs often fold into stable structures upon binding to specific targets. The mechanisms of these coupled bindi...

متن کامل

Unfoldomics of human genetic diseases: illustrative examples of ordered and intrinsically disordered members of the human diseasome.

Intrinsically disordered proteins (IDPs) constitute a recently recognized realm of atypical biologically active proteins that lack stable structure under physiological conditions, but are commonly involved in such crucial cellular processes as regulation, recognition, signaling and control. IDPs are very common among proteins associated with various diseases. Recently, we performed a systematic...

متن کامل

Chemical perturbation of an intrinsically disordered region of TFIID distinguishes two modes of transcription initiation

Intrinsically disordered proteins/regions (IDPs/IDRs) are proteins or peptide segments that fail to form stable 3-dimensional structures in the absence of partner proteins. They are abundant in eukaryotic proteomes and are often associated with human diseases, but their biological functions have been elusive to study. In this study, we report the identification of a tin(IV) oxochloride-derived ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016